Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 8250 |
| Missing cells | 7667 |
| Missing cells (%) | 7.7% |
| Duplicate rows | 85 |
| Duplicate rows (%) | 1.0% |
| Total size in memory | 773.6 KiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 10 |
|---|---|
| DateTime | 2 |
| Dataset has 85 (1.0%) duplicate rows | Duplicates |
OVD_sum is highly overall correlated with OVD_t1 and 2 other fields | High correlation |
OVD_t1 is highly overall correlated with OVD_sum and 1 other fields | High correlation |
OVD_t2 is highly overall correlated with OVD_sum and 2 other fields | High correlation |
OVD_t3 is highly overall correlated with OVD_sum and 1 other fields | High correlation |
prod_limit has 6118 (74.2%) missing values | Missing |
highest_balance has 409 (5.0%) missing values | Missing |
report_date has 1114 (13.5%) missing values | Missing |
new_balance is highly skewed (γ1 = 79.0773819) | Skewed |
highest_balance is highly skewed (γ1 = 47.71863449) | Skewed |
OVD_t1 has 7475 (90.6%) zeros | Zeros |
OVD_t2 has 7886 (95.6%) zeros | Zeros |
OVD_t3 has 7983 (96.8%) zeros | Zeros |
OVD_sum has 7330 (88.8%) zeros | Zeros |
pay_normal has 285 (3.5%) zeros | Zeros |
prod_code has 147 (1.8%) zeros | Zeros |
new_balance has 3864 (46.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-12 01:56:39.346941 |
|---|---|
| Analysis finished | 2025-03-12 01:56:45.785809 |
| Duration | 6.44 seconds |
| Software version | ydata-profiling vv4.13.0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
| Distinct | 1125 |
|---|---|
| Distinct (%) | 13.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57821730 |
| Minimum | 54982353 |
|---|---|
| Maximum | 59006239 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 54982353 |
|---|---|
| 5-th percentile | 54984182 |
| Q1 | 54990497 |
| median | 58989048 |
| Q3 | 58996551 |
| 95-th percentile | 59003954 |
| Maximum | 59006239 |
| Range | 4023886 |
| Interquartile range (IQR) | 4006054 |
Descriptive statistics
| Standard deviation | 1822724 |
|---|---|
| Coefficient of variation (CV) | 0.031523165 |
| Kurtosis | -1.1676063 |
| Mean | 57821730 |
| Median Absolute Deviation (MAD) | 9099 |
| Skewness | -0.91248548 |
| Sum | 4.7702928 × 1011 |
| Variance | 3.3223226 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58988212 | 55 | 0.7% |
| 54990497 | 48 | 0.6% |
| 58998646 | 45 | 0.5% |
| 58991343 | 39 | 0.5% |
| 58987276 | 39 | 0.5% |
| 54989251 | 37 | 0.4% |
| 59000307 | 33 | 0.4% |
| 58999208 | 32 | 0.4% |
| 54991742 | 32 | 0.4% |
| 59000510 | 31 | 0.4% |
| Other values (1115) | 7859 |
| Value | Count | Frequency (%) |
| 54982353 | 18 | |
| 54982356 | 7 | 0.1% |
| 54982387 | 11 | |
| 54982463 | 2 | < 0.1% |
| 54982530 | 4 | < 0.1% |
| 54982549 | 10 | |
| 54982579 | 22 | |
| 54982665 | 4 | < 0.1% |
| 54982697 | 2 | < 0.1% |
| 54982721 | 10 |
| Value | Count | Frequency (%) |
| 59006239 | 3 | < 0.1% |
| 59006219 | 3 | < 0.1% |
| 59006193 | 8 | |
| 59006139 | 4 | |
| 59005995 | 3 | < 0.1% |
| 59005917 | 2 | < 0.1% |
| 59005881 | 3 | < 0.1% |
| 59005880 | 8 | |
| 59005871 | 5 | |
| 59005860 | 6 |
OVD_t1
Real number (ℝ)
High correlation  Zeros 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24909091 |
| Minimum | 0 |
|---|---|
| Maximum | 34 |
| Zeros | 7475 |
| Zeros (%) | 90.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 34 |
| Range | 34 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.2501966 |
|---|---|
| Coefficient of variation (CV) | 5.0190376 |
| Kurtosis | 203.50205 |
| Mean | 0.24909091 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.350079 |
| Sum | 2055 |
| Variance | 1.5629917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7475 | |
| 1 | 397 | 4.8% |
| 2 | 147 | 1.8% |
| 3 | 61 | 0.7% |
| 4 | 61 | 0.7% |
| 5 | 26 | 0.3% |
| 6 | 20 | 0.2% |
| 7 | 15 | 0.2% |
| 8 | 14 | 0.2% |
| 9 | 9 | 0.1% |
| Other values (11) | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 7475 | |
| 1 | 397 | 4.8% |
| 2 | 147 | 1.8% |
| 3 | 61 | 0.7% |
| 4 | 61 | 0.7% |
| 5 | 26 | 0.3% |
| 6 | 20 | 0.2% |
| 7 | 15 | 0.2% |
| 8 | 14 | 0.2% |
| 9 | 9 | 0.1% |
| Value | Count | Frequency (%) |
| 34 | 1 | < 0.1% |
| 31 | 2 | < 0.1% |
| 23 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 3 | |
| 13 | 2 | < 0.1% |
| 12 | 4 | |
| 11 | 5 |
OVD_t2
Real number (ℝ)
High correlation  Zeros 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12715152 |
| Minimum | 0 |
|---|---|
| Maximum | 34 |
| Zeros | 7886 |
| Zeros (%) | 95.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 34 |
| Range | 34 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.86004635 |
|---|---|
| Coefficient of variation (CV) | 6.7639489 |
| Kurtosis | 406.08777 |
| Mean | 0.12715152 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.318031 |
| Sum | 1049 |
| Variance | 0.73967973 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7886 | |
| 2 | 127 | 1.5% |
| 1 | 111 | 1.3% |
| 3 | 43 | 0.5% |
| 4 | 31 | 0.4% |
| 5 | 12 | 0.1% |
| 6 | 9 | 0.1% |
| 7 | 9 | 0.1% |
| 9 | 7 | 0.1% |
| 10 | 4 | < 0.1% |
| Other values (6) | 11 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7886 | |
| 1 | 111 | 1.3% |
| 2 | 127 | 1.5% |
| 3 | 43 | 0.5% |
| 4 | 31 | 0.4% |
| 5 | 12 | 0.1% |
| 6 | 9 | 0.1% |
| 7 | 9 | 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 34 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 4 | |
| 10 | 4 | |
| 9 | 7 | |
| 8 | 3 | < 0.1% |
| 7 | 9 | |
| 6 | 9 |
OVD_t3
Real number (ℝ)
High correlation  Zeros 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.36921212 |
| Minimum | 0 |
|---|---|
| Maximum | 35 |
| Zeros | 7983 |
| Zeros (%) | 96.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 35 |
| Range | 35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.9003196 |
|---|---|
| Coefficient of variation (CV) | 7.8554289 |
| Kurtosis | 99.087751 |
| Mean | 0.36921212 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.6446005 |
| Sum | 3046 |
| Variance | 8.4118535 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7983 | |
| 1 | 46 | 0.6% |
| 2 | 35 | 0.4% |
| 3 | 22 | 0.3% |
| 35 | 15 | 0.2% |
| 6 | 14 | 0.2% |
| 5 | 13 | 0.2% |
| 34 | 12 | 0.1% |
| 4 | 12 | 0.1% |
| 9 | 12 | 0.1% |
| Other values (23) | 86 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 7983 | |
| 1 | 46 | 0.6% |
| 2 | 35 | 0.4% |
| 3 | 22 | 0.3% |
| 4 | 12 | 0.1% |
| 5 | 13 | 0.2% |
| 6 | 14 | 0.2% |
| 7 | 5 | 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 12 | 0.1% |
| Value | Count | Frequency (%) |
| 35 | 15 | |
| 34 | 12 | |
| 33 | 6 | 0.1% |
| 32 | 4 | < 0.1% |
| 31 | 3 | < 0.1% |
| 30 | 2 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 24 | 5 | 0.1% |
| 23 | 3 | < 0.1% |
OVD_sum
Real number (ℝ)
High correlation  Zeros 
| Distinct | 393 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 187.6817 |
| Minimum | 0 |
|---|---|
| Maximum | 31500 |
| Zeros | 7330 |
| Zeros (%) | 88.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 107.55 |
| Maximum | 31500 |
| Range | 31500 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1804.2326 |
|---|---|
| Coefficient of variation (CV) | 9.613258 |
| Kurtosis | 185.91802 |
| Mean | 187.6817 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.091835 |
| Sum | 1548374 |
| Variance | 3255255.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7330 | |
| 1 | 76 | 0.9% |
| 30 | 52 | 0.6% |
| 15 | 19 | 0.2% |
| 6 | 15 | 0.2% |
| 2 | 15 | 0.2% |
| 25 | 13 | 0.2% |
| 45 | 12 | 0.1% |
| 16 | 12 | 0.1% |
| 60 | 11 | 0.1% |
| Other values (383) | 695 | 8.4% |
| Value | Count | Frequency (%) |
| 0 | 7330 | |
| 1 | 76 | 0.9% |
| 2 | 15 | 0.2% |
| 3 | 8 | 0.1% |
| 4 | 6 | 0.1% |
| 5 | 7 | 0.1% |
| 6 | 15 | 0.2% |
| 7 | 10 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 31500 | 2 | |
| 31300 | 1 | |
| 30600 | 2 | |
| 30312 | 1 | |
| 29922 | 1 | |
| 29890 | 1 | |
| 29700 | 1 | |
| 29645 | 1 | |
| 28984 | 1 | |
| 28105 | 1 |
pay_normal
Real number (ℝ)
Zeros 
| Distinct | 37 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.526667 |
| Minimum | 0 |
|---|---|
| Maximum | 36 |
| Zeros | 285 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 11 |
| Q3 | 25 |
| 95-th percentile | 36 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 12.053627 |
|---|---|
| Coefficient of variation (CV) | 0.82975865 |
| Kurtosis | -1.0761803 |
| Mean | 14.526667 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.55839522 |
| Sum | 119845 |
| Variance | 145.28993 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 910 | 11.0% |
| 36 | 651 | 7.9% |
| 2 | 419 | 5.1% |
| 3 | 345 | 4.2% |
| 4 | 315 | 3.8% |
| 9 | 299 | 3.6% |
| 6 | 294 | 3.6% |
| 35 | 286 | 3.5% |
| 0 | 285 | 3.5% |
| 10 | 284 | 3.4% |
| Other values (27) | 4162 |
| Value | Count | Frequency (%) |
| 0 | 285 | 3.5% |
| 1 | 910 | |
| 2 | 419 | |
| 3 | 345 | 4.2% |
| 4 | 315 | 3.8% |
| 5 | 283 | 3.4% |
| 6 | 294 | 3.6% |
| 7 | 279 | 3.4% |
| 8 | 283 | 3.4% |
| 9 | 299 | 3.6% |
| Value | Count | Frequency (%) |
| 36 | 651 | |
| 35 | 286 | |
| 34 | 168 | 2.0% |
| 33 | 119 | 1.4% |
| 32 | 105 | 1.3% |
| 31 | 117 | 1.4% |
| 30 | 86 | 1.0% |
| 29 | 88 | 1.1% |
| 28 | 121 | 1.5% |
| 27 | 103 | 1.2% |
prod_code
Real number (ℝ)
Zeros 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.232 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 147 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 13 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.5330549 |
|---|---|
| Coefficient of variation (CV) | 0.42918548 |
| Kurtosis | 2.4727832 |
| Mean | 8.232 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.036071404 |
| Sum | 67914 |
| Variance | 12.482477 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 4523 | |
| 6 | 1144 | 13.9% |
| 5 | 962 | 11.7% |
| 1 | 427 | 5.2% |
| 13 | 425 | 5.2% |
| 2 | 239 | 2.9% |
| 0 | 147 | 1.8% |
| 7 | 147 | 1.8% |
| 12 | 56 | 0.7% |
| 19 | 35 | 0.4% |
| Other values (11) | 145 | 1.8% |
| Value | Count | Frequency (%) |
| 0 | 147 | 1.8% |
| 1 | 427 | 5.2% |
| 2 | 239 | 2.9% |
| 3 | 21 | 0.3% |
| 4 | 4 | < 0.1% |
| 5 | 962 | |
| 6 | 1144 | |
| 7 | 147 | 1.8% |
| 8 | 24 | 0.3% |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 3 | < 0.1% |
| 26 | 14 | 0.2% |
| 25 | 5 | 0.1% |
| 24 | 24 | 0.3% |
| 22 | 3 | < 0.1% |
| 19 | 35 | 0.4% |
| 17 | 22 | 0.3% |
| 15 | 22 | 0.3% |
| 13 | 425 | |
| 12 | 56 | 0.7% |
prod_limit
Real number (ℝ)
Missing 
| Distinct | 321 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 6118 |
| Missing (%) | 74.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85789.702 |
| Minimum | 1.1 |
|---|---|
| Maximum | 660000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 1.1 |
|---|---|
| 5-th percentile | 11000 |
| Q1 | 37400 |
| median | 68200 |
| Q3 | 112200 |
| 95-th percentile | 215847.5 |
| Maximum | 660000 |
| Range | 659998.9 |
| Interquartile range (IQR) | 74800 |
Descriptive statistics
| Standard deviation | 74345.828 |
|---|---|
| Coefficient of variation (CV) | 0.8666055 |
| Kurtosis | 11.844532 |
| Mean | 85789.702 |
| Median Absolute Deviation (MAD) | 36300 |
| Skewness | 2.6682966 |
| Sum | 1.8290365 × 108 |
| Variance | 5.5273022 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55000 | 103 | 1.2% |
| 11000 | 86 | 1.0% |
| 33000 | 51 | 0.6% |
| 27500 | 47 | 0.6% |
| 22000 | 46 | 0.6% |
| 82500 | 45 | 0.5% |
| 44000 | 44 | 0.5% |
| 16500 | 37 | 0.4% |
| 110000 | 37 | 0.4% |
| 66000 | 35 | 0.4% |
| Other values (311) | 1601 | 19.4% |
| (Missing) | 6118 |
| Value | Count | Frequency (%) |
| 1.1 | 1 | < 0.1% |
| 1100 | 2 | < 0.1% |
| 1650 | 3 | |
| 2090 | 2 | < 0.1% |
| 2200 | 5 | |
| 2750 | 3 | |
| 4070 | 1 | < 0.1% |
| 5500 | 6 | |
| 6050 | 1 | < 0.1% |
| 9240 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 660000 | 3 | |
| 566500 | 1 | < 0.1% |
| 550000 | 4 | |
| 543400 | 1 | < 0.1% |
| 526900 | 1 | < 0.1% |
| 481800 | 1 | < 0.1% |
| 468600 | 1 | < 0.1% |
| 458700 | 1 | < 0.1% |
| 412500 | 3 | |
| 410300 | 1 | < 0.1% |
update_date
Date
| Distinct | 3041 |
|---|---|
| Distinct (%) | 37.0% |
| Missing | 26 |
| Missing (%) | 0.3% |
| Memory size | 64.6 KiB |
| Minimum | 1988-07-19 00:00:00 |
|---|---|
| Maximum | 2016-05-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
new_balance
Real number (ℝ)
Skewed  Zeros 
| Distinct | 3939 |
|---|---|
| Distinct (%) | 47.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105404.2 |
| Minimum | -40303.2 |
|---|---|
| Maximum | 1.6321196 × 108 |
| Zeros | 3864 |
| Zeros (%) | 46.8% |
| Negative | 372 |
| Negative (%) | 4.5% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | -40303.2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 24948 |
| 95-th percentile | 313394.64 |
| Maximum | 1.6321196 × 108 |
| Range | 1.6325226 × 108 |
| Interquartile range (IQR) | 24948 |
Descriptive statistics
| Standard deviation | 1887704.1 |
|---|---|
| Coefficient of variation (CV) | 17.909193 |
| Kurtosis | 6770.8746 |
| Mean | 105404.2 |
| Median Absolute Deviation (MAD) | 76.2 |
| Skewness | 79.077382 |
| Sum | 8.6958464 × 108 |
| Variance | 3.5634269 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3864 | |
| -1.2 | 81 | 1.0% |
| -2.4 | 16 | 0.2% |
| -4.8 | 12 | 0.1% |
| -3.6 | 10 | 0.1% |
| 1.2 | 7 | 0.1% |
| 682.8 | 6 | 0.1% |
| 673.2 | 6 | 0.1% |
| 18000 | 6 | 0.1% |
| 6000 | 6 | 0.1% |
| Other values (3929) | 4236 |
| Value | Count | Frequency (%) |
| -40303.2 | 1 | |
| -32662.8 | 1 | |
| -25200 | 1 | |
| -22606.8 | 1 | |
| -14156.4 | 1 | |
| -13485.6 | 1 | |
| -12684 | 1 | |
| -12134.4 | 1 | |
| -11680.8 | 1 | |
| -11284.8 | 1 |
| Value | Count | Frequency (%) |
| 163211958 | 1 | |
| 32493420 | 1 | |
| 16800000 | 1 | |
| 14351192.4 | 1 | |
| 9619698 | 1 | |
| 9567460.8 | 1 | |
| 8421901.2 | 1 | |
| 8220792 | 1 | |
| 7937396.4 | 1 | |
| 6519902.4 | 1 |
highest_balance
Real number (ℝ)
Missing  Skewed 
| Distinct | 5140 |
|---|---|
| Distinct (%) | 65.6% |
| Missing | 409 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 219202.73 |
| Minimum | 501 |
|---|---|
| Maximum | 1.800005 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 64.6 KiB |
Quantile statistics
| Minimum | 501 |
|---|---|
| 5-th percentile | 4824 |
| Q1 | 23453 |
| median | 44047 |
| Q3 | 100500 |
| 95-th percentile | 531500 |
| Maximum | 1.800005 × 108 |
| Range | 1.8 × 108 |
| Interquartile range (IQR) | 77047 |
Descriptive statistics
| Standard deviation | 2814536.4 |
|---|---|
| Coefficient of variation (CV) | 12.839879 |
| Kurtosis | 2599.7929 |
| Mean | 219202.73 |
| Median Absolute Deviation (MAD) | 27475 |
| Skewness | 47.718634 |
| Sum | 1.7187686 × 109 |
| Variance | 7.9216152 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100500 | 151 | 1.8% |
| 150500 | 102 | 1.2% |
| 200500 | 84 | 1.0% |
| 30500 | 66 | 0.8% |
| 250500 | 54 | 0.7% |
| 300500 | 53 | 0.6% |
| 500500 | 53 | 0.6% |
| 50500 | 49 | 0.6% |
| 400500 | 42 | 0.5% |
| 40500 | 39 | 0.5% |
| Other values (5130) | 7148 | |
| (Missing) | 409 | 5.0% |
| Value | Count | Frequency (%) |
| 501 | 4 | |
| 502 | 1 | < 0.1% |
| 511 | 1 | < 0.1% |
| 514 | 1 | < 0.1% |
| 518 | 1 | < 0.1% |
| 550 | 1 | < 0.1% |
| 556 | 1 | < 0.1% |
| 581 | 1 | < 0.1% |
| 600 | 1 | < 0.1% |
| 601 | 8 |
| Value | Count | Frequency (%) |
| 180000500 | 1 | |
| 100000500 | 1 | |
| 95464005 | 1 | |
| 85000500 | 1 | |
| 35661277 | 1 | |
| 18000500 | 1 | |
| 14000500 | 1 | |
| 12000500 | 1 | |
| 10000500 | 1 | |
| 8580500 | 1 |
report_date
Date
Missing 
| Distinct | 1862 |
|---|---|
| Distinct (%) | 26.1% |
| Missing | 1114 |
| Missing (%) | 13.5% |
| Memory size | 64.6 KiB |
| Minimum | 1996-02-24 00:00:00 |
|---|---|
| Maximum | 2016-06-17 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Interactions
Correlations
| OVD_sum | OVD_t1 | OVD_t2 | OVD_t3 | highest_balance | id | new_balance | pay_normal | prod_code | prod_limit | |
|---|---|---|---|---|---|---|---|---|---|---|
| OVD_sum | 1.000 | 0.894 | 0.635 | 0.559 | 0.015 | -0.034 | -0.119 | 0.077 | 0.042 | -0.095 |
| OVD_t1 | 0.894 | 1.000 | 0.542 | 0.301 | 0.021 | -0.021 | -0.080 | 0.125 | 0.035 | -0.061 |
| OVD_t2 | 0.635 | 0.542 | 1.000 | 0.525 | 0.008 | -0.005 | -0.135 | 0.016 | 0.024 | -0.139 |
| OVD_t3 | 0.559 | 0.301 | 0.525 | 1.000 | -0.007 | -0.027 | -0.138 | -0.068 | 0.020 | -0.142 |
| highest_balance | 0.015 | 0.021 | 0.008 | -0.007 | 1.000 | -0.031 | 0.351 | 0.186 | -0.363 | 0.474 |
| id | -0.034 | -0.021 | -0.005 | -0.027 | -0.031 | 1.000 | 0.015 | 0.010 | 0.025 | 0.026 |
| new_balance | -0.119 | -0.080 | -0.135 | -0.138 | 0.351 | 0.015 | 1.000 | 0.102 | -0.097 | 0.327 |
| pay_normal | 0.077 | 0.125 | 0.016 | -0.068 | 0.186 | 0.010 | 0.102 | 1.000 | 0.119 | 0.022 |
| prod_code | 0.042 | 0.035 | 0.024 | 0.020 | -0.363 | 0.025 | -0.097 | 0.119 | 1.000 | 0.036 |
| prod_limit | -0.095 | -0.061 | -0.139 | -0.142 | 0.474 | 0.026 | 0.327 | 0.022 | 0.036 | 1.000 |
Missing values
Sample
| id | OVD_t1 | OVD_t2 | OVD_t3 | OVD_sum | pay_normal | prod_code | prod_limit | update_date | new_balance | highest_balance | report_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 58987402 | 0 | 0 | 0 | 0 | 1 | 10 | 16500.0 | 04/12/2016 | 0.0 | NaN | NaN |
| 1 | 58995151 | 0 | 0 | 0 | 0 | 1 | 5 | NaN | 04/12/2016 | 588720.0 | 491100.0 | NaN |
| 2 | 58997200 | 0 | 0 | 0 | 0 | 2 | 5 | NaN | 04/12/2016 | 840000.0 | 700500.0 | 22/04/2016 |
| 3 | 54988608 | 0 | 0 | 0 | 0 | 3 | 10 | 37400.0 | 03/12/2016 | 8425.2 | 7520.0 | 25/04/2016 |
| 4 | 54987763 | 0 | 0 | 0 | 0 | 2 | 10 | NaN | 03/12/2016 | 15147.6 | NaN | 26/04/2016 |
| 5 | 59004828 | 0 | 0 | 0 | 0 | 3 | 10 | 88000.0 | 02/12/2016 | 3196.8 | 6193.0 | 15/04/2016 |
| 6 | 58994429 | 0 | 0 | 0 | 0 | 2 | 10 | 16500.0 | 02/12/2016 | 3252.0 | 3210.0 | NaN |
| 7 | 54987756 | 0 | 0 | 0 | 0 | 2 | 1 | NaN | 02/12/2016 | 365331.6 | 304943.0 | NaN |
| 8 | 58988028 | 0 | 0 | 0 | 0 | 4 | 0 | NaN | 02/12/2016 | 16795.2 | 28500.0 | 19/04/2016 |
| 9 | 58993180 | 0 | 0 | 0 | 0 | 3 | 6 | NaN | 02/12/2016 | 26688.0 | 31300.0 | 20/03/2016 |
| id | OVD_t1 | OVD_t2 | OVD_t3 | OVD_sum | pay_normal | prod_code | prod_limit | update_date | new_balance | highest_balance | report_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8240 | 59003535 | 0 | 0 | 0 | 0 | 4 | 6 | NaN | NaN | 8332.8 | 11040.0 | NaN |
| 8241 | 54988244 | 0 | 0 | 0 | 0 | 2 | 13 | NaN | NaN | 46041.6 | 45499.0 | NaN |
| 8242 | 58998715 | 0 | 0 | 0 | 0 | 1 | 2 | NaN | NaN | 154012.8 | 128844.0 | 16/12/2015 |
| 8243 | 58998715 | 0 | 0 | 0 | 0 | 1 | 2 | NaN | NaN | 1448944.8 | 2415400.0 | 22/11/2015 |
| 8244 | 58999145 | 0 | 0 | 0 | 0 | 14 | 12 | NaN | NaN | 0.0 | 126500.0 | NaN |
| 8245 | 58995478 | 0 | 0 | 0 | 0 | 9 | 15 | NaN | NaN | 0.0 | NaN | NaN |
| 8246 | 54992408 | 0 | 0 | 0 | 0 | 1 | 2 | NaN | NaN | 0.0 | NaN | NaN |
| 8247 | 54988209 | 0 | 0 | 0 | 0 | 5 | 13 | NaN | NaN | 20654.4 | 33315.0 | NaN |
| 8248 | 54992408 | 0 | 0 | 0 | 0 | 1 | 2 | NaN | NaN | 0.0 | NaN | NaN |
| 8249 | 54989207 | 0 | 0 | 0 | 0 | 1 | 5 | NaN | NaN | 240000.0 | 200500.0 | NaN |
Duplicate rows
Most frequently occurring
| id | OVD_t1 | OVD_t2 | OVD_t3 | OVD_sum | pay_normal | prod_code | prod_limit | update_date | new_balance | highest_balance | report_date | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29 | 54992408 | 0 | 0 | 0 | 0 | 1 | 2 | NaN | NaN | 0.0 | NaN | NaN | 4 |
| 30 | 58982978 | 0 | 0 | 0 | 0 | 1 | 0 | NaN | 07/11/2011 | 0.0 | NaN | NaN | 3 |
| 49 | 58989600 | 0 | 0 | 0 | 0 | 4 | 15 | NaN | 26/04/2015 | 0.0 | 100500.0 | 26/04/2015 | 3 |
| 50 | 58989600 | 0 | 0 | 0 | 0 | 20 | 10 | 125400.0 | 28/03/2014 | 100155.6 | 121376.0 | 25/11/2015 | 3 |
| 51 | 58989600 | 0 | 0 | 0 | 0 | 36 | 10 | NaN | 12/08/2012 | 37899.6 | 59730.0 | 23/10/2015 | 3 |
| 0 | 54985410 | 1 | 0 | 0 | 6 | 4 | 10 | NaN | 08/11/2007 | 0.0 | 11676.0 | 26/02/2008 | 2 |
| 1 | 54986948 | 0 | 0 | 0 | 0 | 1 | 10 | NaN | 16/08/1997 | 0.0 | NaN | NaN | 2 |
| 2 | 54987336 | 0 | 0 | 6 | 5196 | 2 | 10 | NaN | 15/01/2002 | 0.0 | 91755.0 | 14/03/2014 | 2 |
| 3 | 54988645 | 0 | 0 | 0 | 0 | 2 | 6 | NaN | 10/05/2011 | 0.0 | 14500.0 | 22/06/2012 | 2 |
| 4 | 54989251 | 0 | 0 | 0 | 0 | 0 | 10 | NaN | 19/04/1997 | 0.0 | NaN | NaN | 2 |